Genetics in Medicine
○ Elsevier BV
Preprints posted in the last 30 days, ranked by how well they match Genetics in Medicine's content profile, based on 69 papers previously published here. The average preprint has a 0.09% match score for this journal, so anything above that is already an above-average fit.
Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.
Show abstract
PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.
Meng, M.; Liu, L.; Du, Q.; Zhou, X.; Tian, Y.; Sun, K.; Li, N.; Zhang, P.; Lian, X.; Fan, N.; Zhu, N.; Li, S.; Mao, A.; Li, Y.; Zou, G.
Show abstract
Background: Artificial intelligence (AI)-driven variant prioritization has demonstrated substantial utility in expediting genetic diagnosis by ranking the most likely causative variants. While a variety of tools have been developed, few address the unique clinical and technical constraints in prenatal genetic diagnosis. Methods: We introduce Berrylyzer, a novel, end-to-end variant prioritization system applied to prenatal diagnosis.Inspired by clinician's reasoning process during variant interpretation, Berrylyzer applies a modular, stepwise scoring architecture that jointly integrates phenotypic and genomic evidence and delivers a ranked list of candidate variants, achieving high computational efficiency without compromising analytical rigor. Moreover, Berrylyzer natively supports both structured ontologies and free-text clinical narratives, enabling flexible integration into diverse clinical environments. Its performance was rigorously evaluated across two independent, real-world prenatal cohorts and benchmarked against three state-of-the-art methods: Xrare, Exomiser, and PhenIX. Results: Across the two datasets, Berrylyzer ranked 56.41% and 58.12% of diagnostic variants first, and achieved recall rates of 94.02% and 97.42% within top 20, respectively. Berrylyzer outperformed Xrare (85.19% and 87.08%), Exomiser (84.90% and 85.98%), and PhenIX (82.05% and 88.93%). Stratified analysis consistently demonstrated superior performance across diverse disease categories, inheritance patterns, and analytical strategies. Notably, Berrylyzer exhibited robustness regardless of phenotype forms, yielding comparable top 20 recall rates for free-text descriptions and standardized terminologies. Conclusion: Berrylyzer represents an accurate, interpretable, and computationally lightweight variant prioritization system for prenatal genetic diagnosis. The superior performance across heterogeneous diagnostic contexts enables it as a practical solution for seamless integration into clinical pipelines, thereby advancing precision medicine in prenatal settings.
Stewart, D.; Kim, J.; Haley, J. S.; Li, J.; Sargen, M. R.; Hong, H. G.; Tischkowitz, M.; McReynolds, L. J.; Carey, D. J.
Show abstract
PURPOSE To evaluate cancer risk, age-specific penetrance, and mortality associated with heterozygous pathogenic or likely pathogenic (P/LP) germline PALB2 variants identified through genomic ascertainment and to assess modification by family history of cancer. PATIENTS AND METHODS We conducted a case-control study in two large population-based adult cohorts: the UK Biobank (n=469,580) and Geisinger MyCode (n=167,050). Individuals with heterozygous PALB2 P/LP variants were identified via exome sequencing and compared with non-carriers. Cancer diagnoses and vital status were obtained from linked registry and electronic health record data. We used multivariable logistic regression to estimate odds ratios (ORs) for cancer outcomes and Cox proportional hazards models to estimate hazard ratios (HRs) for all-cause mortality. Age-specific cumulative incidence (penetrance) was estimated using Kaplan-Meier methods. Models were adjusted for birth year, sex (when applicable), smoking status, and body mass index; stratified analyses assessed modification by family history of cancer. RESULTS PALB2 P/LP variant prevalence was 1:571 in UK Biobank and 1:940 in MyCode, with the higher prevalence in the UK cohort driven by the PALB2 p.Trp1038Ter founder variant. Compared with non-carriers, heterozygotes had significantly increased odds of any cancer, female breast cancer, pancreatic cancer, and cancers of ill-defined or secondary sites in both cohorts (P < 0.01). Adjusted hazard ratios for any cancer and female breast cancer ranged from 1.7 to 3.6. All-cause mortality was increased among PALB2-heterozygotes (HR 1.61-1.67), and survival after cancer diagnosis was reduced. Family history further modified cancer risk. CONCLUSION Genomic ascertainment of PALB2-heterozygotes identifies elevated risk for multiple cancers and increased mortality, although risks were lower than estimates from familial ascertainment. These findings inform risk management for individuals identified through genomic screening.
Nguyen, M.-H.; Yang, C.-T.; Cassini, T. A.; Ma, F.; Hamid, R.; Bastarache, L.; Peterson, J. F.; Xu, H.; Li, L.; Ma, S.; Shyr, C.
Show abstract
Background: Large language models (LLMs) have been evaluated as tools to assist rare disease diagnosis, yet evidence on their accuracy remains fragmented. We conducted a systematic review and meta-analysis to synthesize the available evidence on the diagnostic performance of LLMs, identify sources of heterogeneity, and evaluate the current evidence base for clinical translation. Methods: We searched PubMed, Embase, Web of Science, Cochrane Library, arXiv, and medRxiv (January 2020-February 2026). Full-text articles and preprints were considered for inclusion. Eligible studies applied LLM-based systems to generate differential diagnoses for rare diseases and provided Recall@1 (R@1; proportion with the correct diagnosis ranked first). We pooled R@1 using Freeman-Tukey double arcsine transformation with DerSimonian-Laird random-effects models. Pre-specified subgroup analyses examined LLM knowledge augmentation strategy and input modality. Because both retained high residual heterogeneity, we conducted a post-hoc exploratory analysis of evaluation benchmark disease composition, mapping diseases from major benchmarks to Orphanet prevalence classifications. Risk of bias was assessed using a modified QUADAS-3 instrument. Findings: We identified 902 records, of which 564 were screened and 15 studies were eligible. These 15 studies contributed 19 system-dataset entries to the meta-analysis (total N=39,529 cases). The pooled R@1 was 43.3% (95% CI 35.1-51.6; I2=99.6%). Augmented LLM systems (agent-based reasoning, retrieval, or fine-tuning; k=8) achieved R@1 of 52.5% (42.0-62.9) versus 35.4% (30.6-40.4) for standalone LLMs (k=11; p=0.004). Post-hoc exploratory analysis indicated that evaluation benchmark disease composition was associated with differences in diagnostic performance: R@1 was lower on the Phenopacket Store dataset, which contained a higher proportion of ultra-rare diseases (52.8%; k=2), than on RareBench (29.3%; k=6) at 21.7% (18.2-25.5) versus 52.0% (40.7-63.2; p<0.001). All 19 system-dataset entries were assessed to be at high risk of bias, most commonly due to potential data leakage and limited reproducibility. No study provided prospective clinical validation. Interpretation: Diagnostic performance of LLM-based systems for rare diseases varied substantially across evaluation benchmarks. Post-hoc exploratory analysis indicated that performance was associated with benchmark disease composition. Performance was higher in benchmarks containing fewer ultra-rare diseases and in systems incorporating external knowledge at inference time. However, all included studies were at high risk of bias, and none reported prospective clinical validation. These findings highlight the need for prevalence-stratified evaluation benchmarks and independent prospective studies before clinical deployment. Funding: This work was supported in part by the National Institutes of Health Common Fund, grant 15-HG-0130 from the National Human Genome Research Institute, U01NS134349 from the National Institute of Neurological Disorders and Stroke, R00LM014429 from the National Library of Medicine, and the Potocsnak Center for Undiagnosed and Rare Disorders.
Mossler, K.; D'Orazio, E.; Hall, K.; Osann, K.; Kimonis, V.; Quintero-Rivera, F.
Show abstract
Objective The decline of the perinatal demise rate is slowing and demises are often unexplained. Significant research has been done regarding diagnostic yield and genetic causes of demise, but little is known about how Geneticist involvement impacts outcomes. The goal of the study was to evaluate post-mortem genetic testing practices and effects of the geneticists involvement. Methods Retrospective data from 111 perinatal demise cases was examined, including rates of prenatal genetic counseling, post-delivery genetics consult, genetic testing, and autopsy investigation. Results In this cohort 54% received genetic testing and 25% received a genetics consult. When compared to those without, cases with genetic specialist involvement were associated with significant increases in testing uptake (p=0.007), diagnostic yield (p<0.001), and patient education (p<0.001). Second trimester stillbirths and those with fewer ultrasound (US) abnormalities were less likely to receive genetic testing (both p values <0.001) and consults (p<0.001, p=0.020). Conclusion Though it was not possible to avoid ascertainment bias, this data demonstrates that geneticist involvement correlates with a higher rate of testing, greater diagnostic yield, and more thorough counseling. These findings underscore the importance of integrating genetics providers into perinatal postmortem healthcare teams.
ahmed, a. K.; Rodaini, s.
Show abstract
Background: Saudi Arabia bears a disproportionate burden of autosomal recessive genetic disorders, driven by consanguineous marriage rates of 50 to 58% and elevated carrier frequencies for conditions such as sickle cell disease (carrier rate up to 25%), betathalassemia (12%), and spinal muscular atrophy (6%). The existing premarital screening program screens for only two conditions. We developed SafeGene, a computational platform that expands predictive genetic screening to 50+ conditions using region specific population genetics. Methods: SafeGene integrates five risk calculation engines: (1) Mendelian inheritance models for AR, AD, XR, and XD conditions; (2) Hardy Weinberg equilibrium based carrier probability estimation using Saudi, Gulf, and global databases; (3) a six level consanguinity coefficient calculator (F = 0 to 1/8) with risk amplification multipliers; (4) multifactorial polygenic risk models for 12 complex diseases; and (5) maternal age dependent trisomy risk curves. Built using React.js, Node.js/Express, and MongoDB with bilingual Arabic/English support. Results: The platform encompasses 50 genetic conditions across 12 categories. Validation against published Saudi data demonstrated concordance with observed disease frequencies. Economic modeling projects that expanding screening could prevent 2,800 to 4,200 affected births annually, yielding savings of SAR 1.2 to 2.8 billion ($320 to 746 million USD) per year. Conclusions: SafeGene represents a scalable, evidence-based digital health solution for comprehensive genetic screening addressing the unique population genetics of consanguineous Gulf societies. The platform is protected under pending patent applications in South Africa (CIPC) and Saudi Arabia (SAIP).
Baxter, S. M.; Singer-Berk, M.; Glaze, C.; Russell, K.; Grant, R. H.; Groopman, E.; Lee, J.; Watts, N.; Wood, J. C.; Wilson, M.; Rare As One Network, ; Rehm, H. L.; O'Donnell-Luria, A.
Show abstract
Introduction: Accurate estimation of disease prevalence is crucial for public health and therapeutic development, but traditional methods are often inaccurate. Genetic prevalence, which estimates the proportion of a population with a causal genotype, using allele frequencies from population data, offers an important alternative. Methods: We partnered with 18 Rare As One patient organizations to estimate genetic prevalence for 22 autosomal recessive conditions using population data from two releases of the Genome Aggregation Database (gnomAD). To standardize and democratize these analyses, we developed the Genetic Prevalence Estimator (GeniE), a publicly available tool, for accessible calculations. Results: Conservative carrier frequencies in gnomAD v4.1 ranged from 1/164 to 1/11,888. The median change in genetic prevalence frequency between v2.1 to v4.1 was 0.806. Partnership with patient advocacy groups provided critical real-world context that refined the interpretation of these estimates. Discussion: These findings highlight that genetic prevalence is not a static figure but a dynamic, evolving measure with important caveats that need to be considered. Our study underscores the necessity of re-evaluations as databases expand. By integrating patient-partnered insights with the GeniE platform, we empower the genomics community to maintain transparent, up-to-date, and actionable data for rare disease advocacy and drug development.
Chawla, A.; Carter, S.; Dyas, R.; Williams, E.; Moore, C.; Conyers, R.
Show abstract
Background: Pharmacogenomic testing (PGx) can optimise drug efficacy and minimise toxicity, but the extent of prescriber adherence to PGx recommendations remains unclear. We aimed to quantify clinician adherence to international genotype-guided prescribing recommendations in a cohort of paediatric oncology patients. Methods: We reviewed files of children enrolled in the MARVEL-PIC (NCT05667766) randomised control trial, who had PGx recommendations available. Patients were included if 12 weeks had passed since their PGx report was released to clinicians. Prescribing events were identified for actionable PGx recommendations, and classified as "explicitly followed", "inadvertently followed", or "not followed". Adherence was assessed by patient, drug, and recommendation. Results: 2,063 PGx recommendations were available for 216 patients. 64 (3.1%) recommendations were actionable for 44 patients and 10 drugs within the 12-week study period. Recommendations were explicitly followed in 57/288 (19.8%) of prescribing events, inadvertently followed in 145 (50.3%), and not followed in 86 (29.9%). Mercaptopurine demonstrated the highest rate of explicit adherence (87.5%). No significant associations were observed between adherence and age group, cancer type, drug type, or strength of recommendation. Conclusion: Adherence to pharmacogenomic recommendations was very low, highlighting the need to understand barriers to PGx implementation, and consideration of clinical decision supports to facilitate adherence.
Christian, S.; Belcher, T. C.; Benoit, M.; Chan, A.; Dzwiniel, T.; Ilhan, E.; Jain, S.; Katchmer, K.; Kiamanesh, O.; Lilley, M.; Marcadier, J.; Moreau, S.; Muranyi, A.; Nicolas, A.; Sharma, P.; Zhao, X.; Huculak, C.
Show abstract
Background: Mainstreaming genetic testing has emerged as a strategy to improve access and reduce wait times for patients who may benefit from genetic testing. Ensuring patients fully grasp the implications of testing when formal genetic counselling is not provided, remains a focus for ongoing research. Methods: Patients diagnosed with hypertrophic or dilated cardiomyopathy were offered genetic testing between September 2024 and September 2025 through either the mainstreaming model conducted in cardiology clinics or a referral to Medical Genetics where patients attended an online webinar or a one-on-one genetic counselling appointment. Uptake of testing, time to testing, informed choice and patient satisfaction were evaluated. Results: Among patients offered genetic testing, uptake was higher in the mainstreaming pathway (82%) compared with a referral to Medical Genetics (69%). The difference in access was predominately due to patients not following through with their Genetics referral. Mainstreaming reduced wait times where patients referred to Genetics waited a median of 94-185 additional days to be offered genetic testing. Despite improved access, only 62% of mainstreamed patients were considered informed, compared to 91% of patients that attended a patient webinar through Medical Genetics (p < 0.01). Satisfaction with decision-making was high across both pathways. Conclusion: Integrating genetic testing into cardiology practices increased access and reduced wait times; however, patients demonstrated significantly lower rates of informed decision making compared to those who attended a patient webinar offered through Medical Genetics. These findings highlight the importance of structured education to support informed decision making within mainstreaming pathways.
Cohen-Vig, L.; Munro, J. E.; Reid, J.; witkowski, T.; Sikta, N.; Kraus, D.; Bennett, M. F.; Scheffer, I. E.; Hildebrand, M. S.; Bahlo, M.; Berkovic, S. F.
Show abstract
To date, FBRSL1-related disorder has been reported in five individuals with congenital abnormalities and severe postnatal impairment with or without epilepsy; however, the full extent of the phenotypic and genotypic spectrum remains unclear. Previously reported cases involved small truncating variants apparently escaping nonsense-mediated decay, suggesting either a haploinsufficiency or a dominant-negative mechanism. We report the first case of a complex structural variant at the FBRSL1 locus, resulting in an additional, partially truncated copy of the gene, providing strong evidence for a dominant-negative mechanism. RNA-Seq supported the expression of the additional truncated gene copy. The patient is an infant girl with a profound developmental and epileptic encephalopathy (DEE). The child presented at birth with intrauterine growth restriction, respiratory insufficiency, severe swallowing dysfunction, spasticity, contractures, optic nerve hypoplasia, facial dysmorphism, and atrial septal defect. She developed severe postnatal growth restriction with microcephaly and profound developmental impairment. She has a DEE with frequent neonatal focal seizures evolving to infantile epileptic spasms syndrome (IESS). Our patient has congenital abnormalities in common with previously reported cases, along with a profound DEE, not associated previously with FBRSL1. Our case expands both the phenotypic and genotypic spectrum of FBRSL1-related disorder.
Abdelhakim, M.; Althagafi, A.; SCHOFIELD, P.; Hoehndorf, R.
Show abstract
Genotype-phenotype databases are essential for variant interpretation and disease gene discovery. Genetic variation differs among human populations, mainly in allele frequencies and haplotype patterns shaped by ancestry and demographic history. Population-specific genotypes can influence traits and disease risk; this makes population specific characterization important. Most existing resources focus on the characterization of a population's genetic background, but do not represent the resulting phenotypes. We have developed PAVS (Phenotype-Associated Variants in Saudi Arabia), a curated, publicly accessible database that integrates 5,132 Saudi clinical cases from four Saudi cohorts and 522 cases from analysis of a mixed-population cohort, together with 1,856 cases from the Deciphering Developmental Disorders study (DDD) and 9,588 literature phenopackets. Each case record describes patient-level phenotypes, encoded with the Human Phenotype Ontology (HPO), and links them to genomic variants, gene identifiers, zygosity, pathogenicity classifications, and disease diagnoses mapped to standardized disease terminologies. The data is represented in Phenopackets format and as a knowledge graph in RDF. Additionally, a web interface provides phenotype-based similarity search, gene and variant browsers, and an HPO hierarchy explorer. We evaluate the utility of the phenotype annotations for gene prioritization using semantic similarity. While there are clear differences to global literature-curated databases, phenotypes in PAVS can successfully rank the correct gene at high rank (ROCAUC: 0.89). PAVS addresses a gap in population-specific genotype-phenotype resources and provides a benchmark for phenotype-driven variant prioritization in under-represented populations.
Dario, P.
Show abstract
Variant databases ClinVar and gnomAD are the backbone of clinical variant interpretation, but their population composition is skewed toward European ancestry. Whether this skew creates systematic classification disadvantages for non-European patients with monogenic diabetes has not been examined at the database level. ClinVar variant_summary (GRCh38, April 2026; 4,421,188 variants) was cross-referenced with gnomAD v4.0 genome data for 17 monogenic diabetes genes. Annotation coverage and variant classification rates were computed stratified by genetic ancestry group (AFR, AMR, EAS, SAS, MID, NFE, FIN, ASJ). Of 14,691 gnomAD variants across the 17 genes, only 29.7% had any ClinVar classification (range: 12.7%-61.3% by gene). Among classified variants, non-Finnish European (NFE) variants had the highest variant of uncertain significance (VUS) rate (32.1%) and the lowest benign/likely benign fraction (41.6%), consistent with a large submission volume without functional follow-up. African-ancestry (AFR) variants showed the second-highest VUS rate (29.2%), not statistically distinguishable from NFE after Bonferroni correction, while all other non-European groups had significantly lower rates (all p < 0.001). GCK showed a pattern inversion - non-European VUS rate (18.5%) exceeding European (15.0%) - consistent with progressive reclassification in European populations absent in non-European cohorts. Annotation coverage and VUS divergence were uncorrelated (r = -0.15, p = 0.57). The primary equity problem is a 70% annotation gap combined with a non-European curation deficit, not a simple VUS excess. Ancestry-stratified evaluation of ClinGen Variant Curation Expert Panel (VCEP) criteria performance is warranted across disease domains.
Bylstra, Y.; Yeo Juann, M.; Teo, J. X.; Goh, J.; Choi, C.; Chan, S.; Song, C.; Chew Yin Goh, J.; Chai, N.; Lieviant, J. A.; Toh, H. J.; Chan, S. H.; Blythe, R.; Menezes, M.; Yang, C.; Hodgson, J.; Graves, N.; Sng, J.; Lim, W. W.; Law, H. Y.; Amor, D.; Baynam, G.; Chan, J. K.; Chan, Y. H.; Tan, P.; Ng, I.; Lim, W. K.; Jamuar, S. S.
Show abstract
Background As part of Singapore's effort towards precision medicine tailored to Asian diversity, we describe the implementation of a nationwide reproductive carrier screening program. Using a customised 112-gene panel, incorporating population-specific recessive genetic diseases, we outline the overall program design, and initial efforts of community and stakeholder engagement, to inform culturally appropriate implementation. Methods Participants receive culturally tailored online education regarding our reproductive screening program and are provided results with genetic counselling and reproductive options. Community and stakeholder perspectives were assessed through questionnaires and consultations with religious leaders. Results Recruitment is nation-wide, and since initiation of our pilot phase in September 2024, 1,619 couples have registered interest, with 60% uptake of those deemed eligible. Among the 456 couples that have received results to date, four couples (0.9%) were identified to be at increased risk. Community questionnaire responses (n=1002), involving couples who participated in the program as well as the general public, indicated interest is high (59%) across the cohort but awareness, intent to participate and implications for reproductive options differed by sociodemographic factors such as ancestry and religion. Healthcare professional respondents (n=113) acknowledged carrier screening will be routine in medical care, but report limited confidence and resources. Engagement with religious leaders indicated support for the program. Conclusion These early program outcomes and community engagement are guiding the implementation of expanding population-based carrier screening in Singapore, contingent on addressing practical challenges through equitable outreach and professional training.
Aziz, M. C.; Wilson, J.; Chow, C. Y.
Show abstract
PIGA-CDG is a congenital disorder of glycosylation caused by pathogenic partial loss-of-function variants in the PIGA gene. PIGA encodes an enzyme responsible for the catalytic transfer of N-acetylglucosamine to phosphatidylinositol during the first step of glycosylphosphatidylinositol anchor biosynthesis. Loss of this enzyme has a widespread phenotypic impact, but primarily results in neurological symptoms including seizures, intellectual disability, and developmental delay. Currently, treatments are limited and focus on symptom management. We developed an eye model of PIGA-CDG that has a reduced eye size. We screened a library of 98% 1,520 FDA/EMA-approved compounds to find drugs that improved the small eye phenotype. This screen revealed numerous drugs that improved eye size, including those that targeted dopamine signaling and cyclooxygenases. Using pharmacological and genetic approaches, we show that modulating dopamine signaling improves the eye size. Genetic inhibition of dopamine 2 receptor signaling and dopamine reuptake improve both the eye model and neurologically relevant PIGA-CDG phenotypes, including seizures and locomotor deficits. We also pharmacologically and genetically validate cyclooxygenase targeting drugs in the eye model. These findings reveal novel biology underlying PIGA-CDG and point towards candidate therapeutic approaches. AUTHOR SUMMARYPIGA-CDG is a rare neurodevelopmental disorder caused by pathogenic variants in the gene PIGA. Patients primarily display neurological symptoms, including seizures, developmental delay, and intellectual disability. Fewer than 100 patients have been identified, and treatment strategies are limited. In the context of rare diseases, de novo drug development is difficult due to the high cost, lengthy development times, and often too small of a patient population to conduct a clinical trial. Our lab leverages drug repurposing screening to circumvent many of the hurdles associated with de novo drug development. Here, we develop and screen FDA- or EMA-approved compounds on a Drosophila model of PIGA-CDG, uncovering novel biology underlying PIGA-associated pathophysiology. We use pharmacological and genetic tools to demonstrate that modifying dopamine signaling and abundance, as well as cyclooxygenase-mediated pathways, contribute to PIGA associated phenotypes. This work highlights promising therapeutic targets for PIGA-CDG.
Liedtke, D.; Rak, K.; Schrode, K. M.; Hehlert, P.; Chamanrou, N.; Bengl, D.; Katana, R.; Heydaran, S.; Doll, J.; Han, M.; Nanda, I.; Senthilan, P. R.; Juergens, L.; Bieniussa, L.; Voelker, J.; Neuner, C.; Hofrichter, M. A.; Schroeder, J.; Schellens, R. T.; de Vrieze, E.; van Wijk, E.; Zechner, U.; Herms, S.; Hoffmann, P.; Mueller, T.; Dittrich, M.; Bartsch, O.; Krawitz, P. M.; Klopocki, E.; Shehata-Dieler, W.; Maroofian, R.; Wang, T.; Worley, P. F.; Goepfert, M. C.; Galehdari, H.; Lauer, A. M.; Haaf, T.; Vona, B.
Show abstract
Abstract Background Understanding the phenotypic spectrum of disease-associated genes is essential for accurate diagnosis and targeted therapy. FRMPD4 (FERM and PDZ Domain Containing 4) has previously been associated with intellectual disability and epilepsy. However, its potential role in non-syndromic hearing loss has not been explored. Methods We performed genetic analysis in two unrelated families presenting with non-syndromic sensorineural hearing loss, identifying maternally inherited missense variants in FRMPD4. Clinical phenotyping included audiological assessment and evaluation for neurodevelopmental involvement. Cross-species expression analyses were conducted in Drosophila, zebrafish, and mouse. Functional characterization included quantitative evaluation of sound-evoked responses in Drosophila nicht gut hoerend (ngh) mutants, assessment of neuronal development and acoustic startle responses in zebrafish loss of function models, and morphological cochlear analyses with auditory brainstem response measurements in knockout mice. Results Three affected males from two unrelated families presented with prelingual, bilaterally symmetrical sensorineural hearing loss, with confirmed congenital onset in one individual and no evidence of neurodevelopmental abnormalities. Cross-species analyses demonstrated evolutionarily conserved expression of FRMPD4 in auditory structures. In Drosophila, quantitative analysis of sound-evoked responses in ngh mutants revealed impaired auditory function. Zebrafish loss of function models exhibited reduced neuronal populations in the otic vesicle and posterior lateral line, abnormal neuromast development, and diminished acoustic startle responses. In mice, Frmpd4 knockout resulted in high-frequency hearing loss and cochlear abnormalities consistent with the human phenotype. Conclusions Our findings expand the phenotypic spectrum of FRMPD4 to include non-syndromic sensorineural hearing loss and establish its evolutionarily conserved role in auditory function. These results have direct implications for genetic diagnosis and variant interpretation in patients with hearing loss.
Esai Selvan, M.; Gould Rothberg, B. E.; Patel, A. A.; Sang, J.; Horowitz, A.; Christiani, D. C.; Klein, R. J.; Gumus, Z. H.
Show abstract
Introduction Lung cancer is rare before age 45, and its inherited genetic basis remains poorly defined. Methods We performed whole-genome sequencing in 171 predominantly young-onset lung cancer patients and integrated these data with whole-exome sequencing from six major lung cancer consortia, yielding 9,065 patients. After quality control, analyses focused on 6,545 individuals of European ancestry, the largest ancestral group. We compared the prevalence of rare pathogenic and likely pathogenic (P/LP) germline variants between 186 young-onset (age <45 years) and 6,359 older patients at gene and gene-set levels using Fisher's exact test, stratified by histology, sex, and smoking status. Polygenic risk scores (PRS) derived from common variants were also evaluated. Results Young-onset patients carried a higher burden of rare germline P/LP variants in DNA damage response (DDR) genes (including BRIP1, ERCC6, MSH5), and in cilia-related genes, notably GPR161. At the pathway level, DDR genes were significantly enriched (OR=1.66, p=0.007), with the strongest signal in the Fanconi Anemia pathway and among females (OR=1.96, p=0.01). Enrichment was also observed in inborn errors of immunity pathways, with strongest signals in antibody deficiency and the complement system genes. Young-onset patients additionally exhibited higher lung cancer PRS. Conclusion Young-onset lung cancer exhibits a distinct germline genetic architecture, characterized by enrichment of rare P/LP variants in DDR, cilia-related, and immune pathways, and an elevated lung cancer PRS. These findings support a greater role for inherited susceptibility in early-onset disease and have implications for risk stratification, earlier screening, and precision prevention.
Vergara, C.; Ni, Z.; Zhong, J.; McKean, D.; Connelly, K. E.; Antwi, S. O.; Arslan, A. A.; Bracci, P. M.; Du, M.; Gallinger, S.; Genkinger, J.; Haiman, C. A.; Hassan, M.; Hung, R. J.; Huff, C.; Kooperberg, C.; Kastrinos, F.; LeMarchand, L.; Lee, W.; Lynch, S. M.; Moore, S. C.; Oberg, A. L.; Park, M. A.; Permuth, J. B.; Risch, H. A.; Scheet, P.; Schwartz, A.; Shu, X.-O.; Stolzenberg-Solomon, R. Z.; Wolpin, B. M.; Zheng, W.; Albanes, D.; Andreotti, G.; Bamlet, W. R.; Beane-Freeman, L.; Berndt, S. I.; Brennan, P.; Buring, J. E.; Cabrera-Castro, N.; Campa, D.; Canzian, F.; Chanock, S. J.; Chen, Y.;
Show abstract
Pancreatic cancer disproportionately affects Black individuals in the United States, but they have limited representation in genetic studies of pancreatic ductal adenocarcinoma (PDAC). To address this gap, we performed admixture mapping and genome-wide association analysis (GWAS) in genetically inferred African ancestry individuals (1,030 cases and 889 controls). Admixture mapping identified three regions with a significantly higher proportion of African ancestry in cases compared to controls (5q33.3, 10p1, 22q12.3). GWAS identified a genome-wide significant association at 5p15.33 (CLPTM1L, rs383009:T>C, T Allele Frequency=0.51, OR:1.45, P value=1.24x10-8), a locus previously associated with PDAC. Known loci at 5p15.33, 7q32.3, 8q24.21 and 7q25.1 also replicated (P value <0.01). Multi-ancestral fine-mapping identified two potential causal SNPs (rs3830069 and rs2735940) at 5p15.33. Collectively these findings identified novel PDAC risk loci and expanded our understanding of this deadly cancer in underrepresented populations, emphasizing the multifactorial nature of PDAC risk including inherited genetic and non-genetic factors. Statement of SignificanceTo understand how genetic variation contributes to PDAC risk in Black people in North American, we studied individuals of genetically-inferred African ancestry. We identified novel risk loci and differences in the contribution of known loci. This demonstrates that ancestry-informed genetic analyses improve our understanding of PDAC risk and enhances discovery.
Raspin, K.; Bartlett, L.; Makin, J.; Wilson, R.; Butorac, K.; Roydhouse, J.; Dickinson, J. L.
Show abstract
BACKGROUND: Prostate cancer (PrCa) is the most commonly diagnosed cancer in men in many countries and is the most heritable of the common cancers. Precision medicine approaches to disease management are not routinely available to most men, yet we know that germline genetic testing can help identify those at high-risk of developing advanced or lethal disease and can influence selection of therapeutics. An integral part of healthcare delivery design is the inclusion of patients/consumers in the development of frameworks for managing health interventions that are tailored to meet their needs. METHODS: In Phase I, we undertook focus group discussions with men previously diagnosed with PrCa (n=20), to determine their opinions, perceptions and expectations of germline genetic testing for PrCa. Focus groups were tape-recorded, transcribed verbatim, coded and then thematically analysed using NVivo. In Phase II, themes were then used to design and development a Precision Medicine in Prostate Cancer Information Toolkit, which was reviewed by patients (n=14), their carers/family members (n=4) and healthcare providers (n=14). RESULTS: In Phase I, knowledge about precision medicine and genetic testing was generally low. The strongest motivation for undertaking testing was to identify family members' risk levels (n=7), and the biggest concern pertained to insurance discrimination (n=5). Phase II data revealed that generally healthcare providers (n=8) found the purpose of the toolkit to be clearer than patients (n=5). Though, patients found the task of imagining the usefulness of the toolkit at the time of diagnosis or beforehand when assessing genetic risk, quite difficult. Participants highlighted that information regarding life insurance, implications for their family and costs associated with testing were of concern. CONCLUSIONS: This study has revealed critical knowledge gaps, preferred communication/support needs, and concerns/risks associated with germline genetic testing in PrCa. Concerns pertaining to family members and insurance discrimination are obvious topics that need to be addressed. Our toolkit may be helpful in addressing knowledge gaps, but further testing is needed to ensure its accessibility across literary and cultural contexts.
Bolmer, E.; Schmidt, P.; Fischer, I.; Rassmann, S.; Ruder, A.; Hustinx, A.; Kirchhoff, A.; Beger, C.; Skaf, K.; Fardipour, M.; Hsieh, T.-C.; Keller, A.; De Rosa, A.; Kalantari, S.; Sirchia, F.; Kotnik, P.; Born, M.; Solomon, B. D.; Waikel, R. L.; Tkemaladze, T.; Abashishvili, L.; Melikidze, E.; Sukhiashvili, A.; Lartsuliani, M.; Nevado, J.; Tenorio, J.; Juergens, J.; Lindschau, M.; Lampe, C.; Moosa, S.; Pantel, J. T.; Mattern, L.; Elbracht, M.; Luk, H.-M.; Travessa, A.; De Victor, J.; Alhashim, M.; Alhashem, A.; AlKaabi, N.; Kocagil, S.; Akbas, E.; Kornak, U.; Rohrer, T.; Pfaeffle, R.; Soucek,
Show abstract
Background: Diagnosing the over 700 known rare bone diseases (RBDs) is inherently challenging and often requires extensive time and multiple clinical visits. Effective treatment, particularly for RBDs with approved therapies, depends on early and precise identification of the specific RBD type. Image recognition artificial intelligence (AI) has the potential to significantly enhance diagnostic processes and improve patient outcomes. Many of these disorders cause characteristic skeletal changes, especially in the hands, and are associated with growth abnormalities. Consequently, affected children routinely undergo hand radiographs for bone age assessment, making these images a widely available yet underutilized diagnostic resource. Materials and Methods: We retrospectively compiled 5,623 multi-institutional hand radiographs from 2,471 patients with 45 different RBDs and 1,382 unaffected controls. We trained two deep learning models: a binary classifier to differentiate between RBD and non-RBD hand radiographs, and a multi-class classifier covering ten RBDs (or RBD groups), using 5-fold cross-validation. Preprocessing included masking, normalization, and data augmentation. Additionally, we applied occlusion sensitivity mapping to visualize class-specific features and evaluated the learned representations through cosine-based retrieval and UMAP projections of the feature space. Results: The affected versus unaffected classifier achieved a balanced accuracy of 85.5% on the test dataset. The ten-class classifier reached a balanced (top-1) accuracy of 76.6%, with top-3 accuracy exceeding 90%. Disorders with highly distinctive phenotypes, such as achondroplasia, achieved accuracies above 95%, whereas phenotypically overlapping disorders, such as ACAN- and SHOX-related short stature, were more frequently confused. Feature space analysis showed that validation samples clustered closely with their respective training distributions, supporting the consistency and generalizability of the learned embeddings. Conclusion: This manuscript presents a proof of principle for the development of Bone2Gene, a next-generation phenotyping (NGP) tool for the detection and differential diagnosis of RBDs, currently based on hand radiographs. Ongoing efforts focus on expanding the dataset to include additional RBDs or RBD groups in the current multi-class classifier for differential diagnosis and to further evaluate its generalizability. The Bone2Gene study is open to collaboration.
Alonso-Gonzalez, A.; Jaspez, D.; Lorenzo-Salazar, J. M.; Delgado, A.; Quintero-Bacallado, A.; Ma, S.-F.; Strickland, E.; Mychaleckyj, J.; Kim, J. S.; Huang, Y.; Adegunsoye, A.; Oldham, J. M.; Maher, T. M.; Guillen-Guio, B.; Wain, L. V.; Allen, R. J.; Saini, G.; Jenkins, R. G.; Molina-Molina, M.; Zhang, D.; Kim Garcia, C.; Martinez, F. J.; Noth, I.; Flores, C.
Show abstract
Background: Idiopathic pulmonary fibrosis (IPF) is a rare disease with a poor prognosis. Disease risk involves rare and common genetic variants. However, an inverse association have been described between them. Accordingly, IPF patients with a higher polygenic risk score (PRS) for IPF are less likely to carry rare deleterious variants and vice versa. Here, we evaluate weather PRS of IPF could serve as an additional criterion to patient prioritisation for rare variant discovery. Methods: We identified carriers based on the presence of rare qualifying variants (QVs) in genes linked to monogenic forms of pulmonary fibrosis in 888 IPF patients from the Pulmonary Fibrosis Foundation Patient Registry (PFF-PR). Genome-wide association study (GWAS) summary statistics from independent cohorts were used to construct a whole-genome PRS (WG-PRS) using a clumping and thresholding method (C+T) and a Bayesian method (SBayesRC). PRS were also derived from 19 known common sentinel IPF variants (Sentinel-PRS). Logistic regression models were used to evaluate associations between PRS and carrier status. Discriminatory performance was evaluated using area under the curve (AUC) analysis, and comparisons were made with DeLong test. Validation was performed in 472 IPF individuals from the UK PROFILE cohort. Results: IPF-PRS were strongly associated with the QVs carrier status: Odds Ratio [OR] 0.65 (95% Confidence Interval [CI] 0.53-0.79) for WG-PRSC+T, OR 0.71 (95% CI 0.59-0.86) for WG-PRSSBayesRC, and OR 0.77 (95% CI 0.63-0.94) for Sentinel-PRS. Adding WG-PRS to the patient personal clinical history improved the prediction of QVs carriers: AUC=0.62 for the clinical model, AUC=0.68 for WG-PRSC+T (DeLong test, p=9.54x10-4) and AUC=0.66 for WG-PRSSBayesRC (DeLong test, p=0.02). Adding of IPF-PRS to clinical variables correctly reclassified 22.8% of carriers when using WG-PRSC+T, 20.8% when using Sentinel-PRS, and 16.7% for WG-PRSSBayesRC. WG-PRSSBayesRC and the Sentinel-PRS also demonstrated improved prediction of QVs carriers in telomere-related genes in PROFILE. Conclusions: Incorporating IPF-PRS into a model based on the patient clinical history improves the identification of QVs carriers. Although the overall discriminatory power was moderate, these findings raise de the possibility of using WG-PRS as useful criterion for rare variant discovery in patients with IPF and enhance decision-making.